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VITALIY 



<110> APPLICANT: LIVSHITS, 
ZAKATAEVA, NATALIA 
ALESHIN, VLADIMIR 
BELAREVA, ALLA 
TOKHMAKOVA; IRINA 
TITLE OF INVENTION: 



<120> 
<130> 
<140> 
<141> 
<150> 
<151> 
<160> 
<170> 
<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<400> 



NOVEL GENE AND METHOD FOR PRODUCING L-AMINO ACIDS 



FILE REFERENCE: 0010-1070-0 

CURRENT APPLICATION NUMBER: 09/466,935 

CURRENT FILING DATE: 1999-12-20 

PRIOR APPLICATION NUMBER: RU98123511 

PRIOR FILING DATE: 1998-12-23 

NUMBER OF SEQ ID NOS : 4 

SOFTWARE: Patentin version 3.0 

SEQ ID NO: 1 

LENGTH: 1231 

TYPE: DNA 

Escherichia coli 



ORGANISM 
FEATURE : 
NAME/KEY 
LOCATION 
SEQUENCE 



CDS 

(557). 

1 



(1171) 



agaaataatg tggagatcgc accgcccatc 
ggaccgggct gaacctcctg ctgccagaat 
cgattaacat gcccgagatg cggatcggct 
tggtcgatga ttaagacatc aaaccccaaa 
tttacgtagc tctcaatacg ccccgggcag 
cgaaaacgga caaagcgcac cggaatgtca 



gaatgtgcca gtatatagcg tttacgccac 
gccgccagat catcaacata atcattaaag 
aacaggcgac cggaacgtcc ctgcccgcga 
tggaacaggt cataggccag ttccgcatat 
atgactacca cccggtcatg gtgctgtgcg 
tccacaccag taaactctgc ttcatcacgc 



tgacgccaga aatcagtcag cggtcccatg gtaaaagcag caaacgcgtt ttctcttgtt 
tcccagtctt tttgctgctg aaacatcggg taatctgcct cttaaaccac gtaaaatcgt 
tttttttagc gtgcctgaca caacgctgcg acagtagcgt attgtggcac aaaaatagac 
acaccgggag ttcatc atg acc tta gaa tgg tgg ttt gcc tac ctg ctg aca 
Met Thr Leu Glu Trp Trp Phe Ala Tyr Leu Leu Thr 



60 
120 
180 
240 
300 
360 
420 
480 
540 
592 



53 










1 








5 










10 






55 


teg 


ate 


att 


tta 


acg 


ctg 


teg 


eca 


ggc 


tct 


ggt 


gca 


ate 


aae 


aet 


atg 


640 


56 


Ser 


He 


He 


Leu 


Thr 


Leu 


Ser 


Pro 


Gly 


Ser 


Gly 


Ala 


He 


Asn 


Thr 


Met 




57 






15 










20 










25 










59 


acc 


acc 


teg 


etc 


aac 


cac 


ggt 


tat 


ccg 


gee 


ggt 


ggc 


gtc 


tat 


tge 


tgg 


688 


60 


Thr 


Thr 


Ser 


Leu 


Asn 


His 


Gly 


Tyr 


Pro 


Ala 


Gly 


Gly 


Val 


Tyr 


Cys 


Trp 




61 




30 










35 










40 












63 


get 


tea 


gac 


egg 


act 


ggc 


gat 


tea 


tat 


tgt 


get 


ggt 


tgg 


cgt 


ggg 


gtt 


736 


64 


Ala 


Ser 


Asp 


Arg 


Thr 


Gly 


Asp 


Ser 


Tyr 


Cys 


Ala 


Gly 


Trp 


Arg 


Gly 


Val 




65 


45 










50 










55 










60 




67 


ggg 


acg 


eta 


ttt 


tec 


ege 


tea 


gtg 


att 


gcg 


ttt 


gaa 


gtg 


ttg 


aag 


tgg 


784 


68 


Gly 


Thr 


Leu 


Phe 


Ser 


Arg 


Ser 


Val 


He 


Ala 


Phe 


Glu 


Val 


Leu 


Lys 


Trp 




69 










65 










70 










75 






71 


gca 


ggc 


gcg 


get 


tac 


ttg 


att 


tgg 


ctg 


gga 


ate 


cag 


cag 


tgg 


ege 


gcc 


832 


72 


Ala 


Gly 


Ala 


Ala 


Tyr 


Leu 


He 


Trp 


Leu 


Gly 


He 


Gin 


Gin 


Trp 


Arg 


Ala 
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73 80 85 

75 get ggt gca att gac ctt aaa teg ctg gcc tct act 

76 Ala Gly Ala lie Asp Leu Lys Ser Leu Ala Ser Thr 



77 
79 



95 



100 



cat ttg ttc cag cgc gca gtt ttt gtg aat etc ace 



80 His Leu Phe Gin Arg Ala Val Phe Val Asn Leu Thr 



81 
83 
84 
85 
87 
88 
89 



110 



90 

caa teg cgt cga 
Gin Ser Arg Arg 
105 

aat ccc aaa agt 
Asn Pro Lys Ser 



115 120 
att gtg ttt ctg gcg gcg eta ttt ccg caa ttc ate 
lie Val Phe Leu Ala Ala Leu Phe Pro Gin Phe lie 
125 130 135 

ccg caa ctg atg cag tat ate gtg etc ggc gte ace 
Pro Gin Leu Met Gin Tyr lie Val Leu Gly Val Thr 
145 150 

91 gat att att gtg atg ate ggt tac gee ace ctt get 

92 Asp He He Val Met He Gly Tyr Ala Thr Leu Ala 

93 160 165 

95 eta tgg att aaa gga cea aag eag atg aag gcg ctg 



9 6 Leu Trp He Lys Gly Pro Lys Gin Met Lys Ala Leu 
97 175 180 

99 gge teg ttg ttt atg ctg gtg gga gcg etg tta gea 

100 Gly Ser Leu Phe Met Leu Val Gly Ala Leu Leu Ala 

101 190 195 200 

103 geg tgaaaaataa tgtcggatgc ggegtaaacg cettatccga 

104 Ala 

105 205 

107 agacgcgtet 

110 <210> SEQ ID NO: 2 

111 <211> LENGTH: 205 

112 <212> TYPE: PRT 

113 <213> ORGANISM: Escherichia coll 
115 <400> SEQUENCE: 2 

117 Met Thr Leu Glu Trp Trp Phe Ala Tyr Leu Leu Thr 

118 15 10 

121 Thr Leu Ser Pro Gly Ser Gly Ala He Asn Thr Met 

122 20 25 

125 Asn His Gly Tyr Pro Ala Gly Gly Val Tyr Cys Trp 

126 35 40 

12 9 Thr Gly Asp Ser Tyr Cys Ala Gly Trp Arg Gly Val 
130 50 55 60 

133 Ser Arg Ser Val He Ala Phe Glu Val Leu Lys Trp 

134 65 70 75 

137 Tyr Leu He Trp Leu Gly He Gin Gin Trp Arg Ala 



atg ccg 
Met Pro 

act att 
Thr He 

caa egg 
Gin Arg 
170 
aat aag 
Asn Lys 
185 

teg geg 
Ser Ala 



caa cag 
Gin Gin 
140 
gtg gte 
Val Val 
155 

att get 
He Ala 

att ttc 
He Phe 

agg eat 
Arg His 



ettaetctga 



138 



85 



90 



141 Asp Leu Lys Ser Leu Ala Ser Thr Gin Ser Arg Arg 



142 



100 



105 



145 Arg Ala Val Phe Val Asn Leu Thr Asn Pro Lys Ser 



146 



115 



120 



14 9 Ala Ala Leu Phe Pro Gin Phe He Met Pro Gin Gin 



Ser He 

Thr Thr 
30 

Ala Ser 
45 

Gly Thr 

Ala Gly 

Ala Gly 

His Leu 
110 
He Val 
125 

Pro Gin 



He Leu 
15 

Ser Leu 

Asp Arg 

Leu Phe 

Ala Ala 
80 

Ala He 
95 

Phe Gin 
Phe Leu 
Leu Met 



880 



928 



976 



1024 



1072 



1120 



1168 



1221 



1231 



150 



130 



135 



140 
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153 


Gin 


Tyr 


He 


Val 


Leu 


Gly 


Val 


Thr 


Thr 


He 


Val 


Val 


Asp 


He 


He 


Val 




154 


145 










150 










155 










160 




157 


Met 


He 


Gly Tyr Ala 


Thr 


Leu 


Ala 


Gin 


Aro 

rax y 


He 


Ala 


Leu 


TrD 


He 


Lys 




158 










165 










170 










175 






161 


Gly 


Pro 


Lys 


Gin 


Met 


Lys 


Ala 


Leu 


Asn 


Lys 


He 


Phe Gly 


Ser 


Leu 


Phe 




162 








180 










185 










190 








165 


Met 


Leu 


Val 


Gly Ala 


Leu 


Leu 


Ala 


Ser 


Ala 


Arg 


His 


Ala 










166 






195 










200 










205 










169 


<210> SEQ ID NO 


: 3 


























170 


<211> LENGTH: 840 


























171 


<212> TYPE: 


DNA 




























172 


<213> ORGANISM: 


Escherichia 


coli 


















174 


<220> FEATURE: 




























175 


<221> NAME/KEY: 


CDS 


























176 


<222> LOCATION: 


(187). . 


(804) 




















178 


<400> SEQUENCE: 


3 


























179 


atgccgatca 


ccgccagcga aatgctcagc gttaacggcg 


ttgggatgcg 


caagctggaa 


60 


181 


cgctttggca 


aaccgtttat ggcgctgatt cgtgcgcatg 


ttgatggega 


tgacgaagag 


120 


183 


tagtcagcag 


cataaaaaag tgccagtatg aagactccgt 


aaacgtttee 


cecgegagtc 


180 


185 


aaatgt atg 


ttg atg tta i 


ttt etc ace gtc gcc atg gtg eac 


att gtg 


228 


186 




Met 


Leu Met Leu Phe Leu Thr Val Ala Met Val His 


He Val 




187 




1 








5 








10 












189 


gcg 


ctt 


atg 


age 


ccc 


ggt 


ccc 


gat 


ttc 


ttt 


ttt 


gtc 


tct 


cag 


acc 


get 


276 


190 


Ala 


Leu 


Met 


Ser 


Pro 


Gly 


Pro 


Asp 


Phe 


Phe 


Phe 


Val 


Ser 


Gin 


Thr 


Ala 




191 


15 










20 










25 










30 




193 


gtc 


agt 


cgt 


tec 


cgt 


aaa 


gaa 


gcg 


atg 


atg 


ggc 


gtg 


ctg 


ggc 


att 


acc 


324 


194 


Val 


Ser 


Arg 


Ser 


Arg 


Lys 


Glu 


Ala 


Met 


Met 


Gly 


Val 


Leu 


Gly 


He 


Thr 




195 










35 










40 










45 






197 


tgc 


ggc 


gta 


atg 


gtt 


tgg 


get 


ggg 


att 


gcg 


ctg 


ctt 


ggc 


ctg 


cat 


ttg 


372 


198 


Cys 


Gly 


val 


Met 


Val 


Trp 


Ala 


Gly 


He 


Ala 


Leu 


Leu 


Gly 


Leu 


His 


Leu 




199 








50 










55 










60 








201 


att 


ate 


gaa 


aaa 


atg 


gcc 


tgg 


ctg 


eat 


acg 


ctg 


att 


atg 


gtg 


ggc 


ggt 


420 


202 


He 


He 


Glu 


Lys 


Met 


Ala 


Trp 


Leu 


His 


Thr 


Leu 


He 


Met 


Val 


Gly 


Gly 




203 






65 










70 










75 










205 


ggc 


ctg 


tat 


etc 


tgc 


tgg 


atg 


ggt 


tac 


cag 


atg 


eta 


cgt 


ggt 


gca 


ctg 


4 68 




Gly 


Leu 


Tyr 


Leu 


Cys 


Trp 


Met 


Gly 


Tyr 


Gin 


Met 


Leu 


Arg 


Gly 


Ala 


Leu 




207 




80 










85 










90 












209 


aaa 


aaa 


gag 


gcg 


gtt 


tct 


gca 


cct 


gcg 


cca 


cag 


gtc 


gag 


ctg 


gcg 


aaa 


516 


210 


Lys 


Lys 


Glu 


Ala 


Val 


Ser 


Ala 


Pro 


Ala 


Pro 


Gin 


Val 


Glu 


Leu 


Ala 


Lys 




211 


95 










100 










105 










110 




213 


agt 


ggg 


cgc 


agt 


ttc 


ctg 


aaa 


ggt 


tta 


ctg 


acc 


aat 


etc 


get 


aat 


ccg 


564 


214 


Ser 


Gly Arg 


Ser 


Phe 


Leu 


Lys 


Gly 


Leu 


Leu 


Thr 


Asn 


Leu 


Ala 


Asn 


Pro 




215 










115 










120 










125 






217 


aaa 


gcg 


att 


ate 


tac 


ttt 


ggc 


teg 


gtg 


ttc 


tea 


ttg 


ttt 


gtc 


ggt 


gat 


612 


218 


Lys 


Ala 


He 


He 


Tyr 


Phe 


Gly 


Ser 


Val 


Phe 


Ser 


Leu 


Phe 


Val 


Gly 


Asp 




219 








130 










135 










140 








221 


aac 


gtt 


ggc 


act 


ace 


gcg 


cgc 


tgg 


ggc 


att 


ttt 


gcg 


ctg 


ate 


att 


gtc 


660 


222 


Asn 


Val Gly 


Thr 


Thr 


Ala 


Arg 


Trp 


Gly 


He 


Phe 


Ala 


Leu 


He 


He 


Val 




223 






145 










150 










155 
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225 


gaa 


acg 


ctg 


gcg 


tgg 


ttt 


acc 


gtc 


gtt 


gcc 


age 




ttt 


gcc 


^ *-y 


ccg 


226 


Glu 


Thr 


Leu 


Ala 


Trp 


Phe 


Thr 


Val 


Val 


Ala 


Ser 


Leu 


Phe 


Ala 


Leu 


Pro 


227 




160 










165 










170 










229 


caa 


atg 


cgc 


cgt 


ggt 


tat 


caa 


cgt 


ctg 


gcg 


aag 


tQO 


att 


gat 


aat 


ttt 


230 


Gin 


Met 


Arg 


Arg 


Gly 


Tyr 


Gin 


Arg 


Leu 


Ala 


Lys 


TrD 


He 


Asp 


Glv 


Phe 


231 


175 










180 










185 










190 


233 


gcc 


ggg 


gcg 


tta 


ttt 


gcc 


gga 


ttt 


ggc 


att 


cat 


tta 


att 


att 


teg 




234 


Ala 


Gly 


Ala 


Leu 


Phe 


Ala 


Gly 


Phe Gly 


He 


His 


Leu 


He 


He 


Ser 


Arg 


235 










195 










200 










205 




237 


tgatgccaga cgcgtcttca gagtaagtcg gataag 














240 


<210> SEQ ID NO 


: 4 
























241 


<211> LENGTH: 206 
























242 


<212> TYPE: 


PRT 


























243 


<213> ORGANISM: 


Escherichia 


coli 
















245 


<4 00> SEQDENCE: 


4 
























247 


Met 


Leu 


Met 


Leu 


Phe 


Leu 


Thr 


Val 


Ala 




Val 


His 


He 


Val 


Ala 


Leu 


248 


1 








5 




















15 




251 


Met 


Ser 


Pro 


Gly 


Pro 


Asp 


Phe 


Phe 


Phe 


vai 


Ser 


Gin 


Thr 


Ala 


Val 


Ser 


252 








20 










25 










30 






255 


Arg 


Ser 


Arg 


Lys 


Glu 


Ala 


Met 


Met 


Gly 


Val 


Leu 


Gly 


He 


Thr 


Cys 




256 






35 










40 










4 5 








259 


Val 


Met 


Val 


Trp 


Ala 


Gly 


He 


Ala 


Leu 


Leu 


Gly 


Leu 


His 


Leu 


He 


He 


260 




50 










55 










60 










263 


Glu 


Lys 


Met 


Ala 


Trp 


Leu 


His 


Thr 


Leu 


He 


Met 


Val 


Gly 






Leu 


264 


65 










70 










75 










80 


267 


Tyr 


Leu 


Cys 


Trp 


Met 


Gly 


Tyr 


Gin 


Met 


Leu 


Arg 


Gly 


Ala 


Leu 


Lys 


Lys 


268 










85 










90 










95 




971 


Glu 


Ala 


Val 


Ser 


Ala 


Pro 


Ala 


Pro 


Gin 


Val 




Leu 




Lys 


oei 




272 








100 










105 










110 






275 


Arg 


Ser 


Phe 


Leu 


Lys 


Gly 


Leu 


Leu 


Thr 


Asn 


Leu 


Ala 


Asn 


Pro 


Lys 


Ala 


276 






115 










120 










125 








279 


He 


He 


Tyr 


Phe 


Gly 


Ser 


Val 


Phe 


Ser 


Leu 


Phe 


Val 


Gly 


Asp 


Asn 


Val 


280 




130 










135 










140 










283 


Gly 


Thr 


Thr 


Ala 


Arg 


Trp 


Gly 


He 


Phe 


Ala 


Leu 


He 


He 


Val 


Glu 


Thr 


284 


145 










150 










155 










160 


287 


Leu 


Ala 


Trp 


Phe 


Thr 


Val 


Val 


Ala 


Ser 


Leu 


Phe 


Ala 


Leu 


Pro 


Gin 


Met 


288 










165 










170 










175 




291 


Arg 


Arg 


Gly 


Tyr 


Gin 


Arg 


Leu 


Ala 


Lys 


Trp 


He 


Asp 


Gly 


Phe 


Ala 


Gly 


292 








180 










185 










190 






295 


Ala 


Leu 


Phe 


Ala 


Gly 


Phe 


Gly 


He 


His 


Leu 


He 


He 


Ser 


Arg 






296 






195 










200 










205 
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